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Description 

[0001] This invention clainns the benefit of U.S. Provisional Application No. 60/068,658. filed Decennber 23, 1997. 
[0002J This invention relates to reconnbinant DNA technology. In particular the invention pertains to a fungal glucan 

s synthase, and to a sub-region thereof thai mediates echinocandin binding and antifungal activity Also contemplated 
is the use of said echinocandin binding region in screens for compounds that bind glucan synthase. 
[0003] The incidence of life-threatening fungal infections is increasing at an alarming rale. About 90% of nosocomial 
fungal infections are caused by species of Candida, with the remaining 10% being attributable \o Aspergillus, Crypto- 
coccus, and Pneumocystis. While effective antifungal compounds have been developed for Candida, there is growing 

10 concern over escalating resistance in other pathogenic fungi. Since anti-Candida compounds rarely are clinically ef- 
fective against other fungi, new compounds are needed for effective antifunal therapy 

[0004] The present Invention provides an echinochandin binding domain of a fungal 1 ,3,p-glucan synthase (herein- 
after " glucan synthase" )that Is useful in identifying compounds that bind and Inhibit glucan synthase activity The 
compositions of this invention enable identification of new and better antifungal compounds. 
15 [0005] In one embodiment the present invention relates to a nucleic acid molecule that encodes an echinocandin 
binding domain of fungal glucan synthase. 

[0006] In another embodiment the present invention relates to a peptide that comprises an echinocandin binding site 
of fungal glucan synthase. 

[0007] In another embodiment, the present invention relates to a method for identifying compounds that bind an 

20 echinocandin binding domain of fungal glucan synthase. 

[0008] "ECB binding domain" or " ECB binding site" or "ECB binding fragment" refers to a subregion of the yeast 
glucan synthase molecule (i.e. product of FKS1 gene in S. cerevisiae), wherein said subregion retains, either alone or 
in combination with another protein, for example, as a fusion protein, the capacity to bind echinocandins such as ECB. 
For example. In one embodiment the present Invention relates to a subregion of SEQ ID NO;2 comprising amino acid 

25 residues 583 to 672. ECB binding fragments may be verified by any suitable test for binding to ECB or other echino- 
candin. or papulocandin, or related compounds. 

[0009] The term "fusion protein" denotes a hybrid protein molecule not found in nature comprising a translatlonal 
fusion or enzymatic fusion in which two or more different proteins or fragments thereof are covalently linked on a single 
polypeptide chain. 

30 [0010] The term "plasmid" refers to an extrachronnosomal genetic element. The starting plasmids herein are either 
commercially available, publicly available on an unrestricted basis, or can be constructed from available plasmids in 
accordance with published procedures. In addition, equivalent plasmids to those described are known In the art and 
will be apparent to the ordinarily skilled artisan. 

[0011] "Recombinant DNA cloning vector" as used herein refers to any autonomously replicating agent, including. 
35 but not limited to, plasmids and phages, comprising a DNA molecule to wrfilch one or more additional DNA segments 
can or have been added. 

[0012] The term "recombinant DNA expression vector" as used herein refers to any recombinant DNA cloning vector, 
for example a plasmid or phage, in which a promoter and other regulatory elements are present to enable transcription 
of the Inserted DNA. 

40 [001 3] The term "vector" as used herein refers to a nucleic acid compound used for Introducing exogenous DNA into 
host cells. A vector comprises a nucleotide sequence which may encode one or nrxjre protein molecules. Plasmids, 
cosmids, viruses, and bacteriophages, In the natural state or which have undergone recombinant engineering, are 
examples of comnr>only used vectors. 

[0014] The terms "complementary" or "complementarity" as used herein refers to the capacity of purine and pyrimi- 
^5 dine nucleotides to associate through hydrogen bonding in double stranded nucleic acid molecules. The following base 
pairs are complementary: guanine and cytosine; adenine and thymine; and adenine and uracil. 
[0015] "Isolated nucleic acid compound" refers to any RNA or DNA sequence, however constructed or synthesized, 
which Is locationally distinct from its natural location 

[0016] A "primer" is a nucleic acid fragment which functions as an initiating substrate for enzymatic or synthetic 
50 elongation of. for example, a nucleic acid molecule. 

[0017] The term "promoter" refers to a DNA sequence which directs transcription of DNA to RNA. 

[0018] A "probe" as used herein is a labeled nucleic acid compound which hybridizes with another nucleic acid 

compound. 

[0019] The term "hybrkiization" as used herein refers to a process in which a single-stranded nucleic acid molecule 
55 joins with a complementary strand through nucleotide base pairing. "Selective hybridization" refers to hybridization 
under conditions of high stringency The degree of hybrldlzatton depends upon, for example, the degree of comple- 
mentarity, the stringency of hybridlzatk)n, and the length of hybridizing strands. 

[0020] The term "stringency" refers to hybridization conditions. High stringency conditions disfavor non -homologous 
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basepairing. Low stringency conditions have the opposite effect. Stringency may be altered, for exannple, by temper- 
ature and salt concentration. 

[0021] 'Low stringency" conditions comprise, for example, a temperature of about 37** C or less, a formamide con- 
centration of less than about 50%, and a moderate to low salt (SSC) concentration; or, alternatively, a temperature of 

5 atx)ut 50" C or less, and a moderate to high salt (SSPE) concentration, for example 1 M NaCI. 

[0022] "High stringency" conditions comprise, for example, a temperature of about 42' C or less, a formamide con- 
centration of less than about 20%, and a low salt (SSC) concentration; or, alternatively, a temperature of about 65" c, 
or less, and a low salt (SSPE) concentration. For example, high stringency conditions comprise hybridization in 0.5 M 
NaHP04, 7% sodium dodecyt sulfate (SDS), 1 mf^ EDTA at 65*C (Ausubel, RM. eta! Current Protocols in Molecular 

10 Biology, Vol. I, 1989; Green Inc. New York, at 2.10.3). 

[0023] "SSC" comprises a hybridization and wash solution. A stock 20X SSC solution contains 3M sodium chloride. 
0.3M sodium citrate, pH 7.0. 

[0024] "SSPE" comprises a hybridization and wash solution. A IX SSPE solution contains 180 mM NaCI. 9ml^ 
Na2HP04, 0.9 mlVI NaH2P04 and 1 mM EDTA, pH 7.4. 
^5 [0025] "Substantially pure" used in reference to a peptide or protein means that said peptide or protein is separated 
from a large fraction of all other cellular and non-cellular molecules, including other protein molecules A substantially 
pure preparation would be about at least 85% pure; preferably about at least 95% pure. For example, a "substantially 
pure" protein as described herein could be prepared by the IMAC protein purification method, or any other suitable 
method. 

20 [0026] Cell walls are essential to the viability of fungi, but have no existence in mammalian cells. This makes synthesis 
of the fungal cell wall a useful target for antifungal compounds. Two polysaccharide polymers, chitin and 1 ,3-[J-glucan, 
are essential components of fungal cell walls. Therefore, antibiotics that interfere with the synthesis of these polymers 
are useful in mycosis therapy Polysaccharides have been estimated to account for as much as 80% to 90% of the 
Saccharomyces cerevisiaeceW wall. The major cell wall polymers are glucan and man nan, and small amounts of chitin. 

25 [0027] In S. cerevisiae, cell wall synthesis is thought to involve at least a subunit of glucan synthase, which is encoded 
by the FKS1 gene (Douglas etaL Proc. Nat. Acad. Sci. 91, 12907-911, 1994). FKS1 encodes a 215 kD integral mem- 
brane protein of 1876 amino acid residues that is the likely target of ECB and other echinocandins {Id.) For example, 
resistance to ECB and other echinocandins maps to the FKS 1 kx;us. More specifically, a domain of FKS 1 , which resides 
at amino acid residues 583 to 672 defines a cytoplasmic loop thought to be necessary and sufficient to comprise an 

30 echinocandin binding domain. 

Gene Isolation Procedures 

[0028] Those skilled in the art will recognize that the nucleic acids of this invention may be obtained by a plurality of 
3S applicable genetic and recombinant DNA techniques including, for example, polymerase chain reaction (PGR) ampli- 
fication, or de novoDNA synthesis. {See eg., J.Sambrook et a/. Molecular Cloning , 2d Ed. Chap. 14 (1989)). 
[0029] Skilled artisans will recognize that a nucleic acid encoding the ECB binding domain coukJ be isolated by PGR 
amplification of any suitable genomic DNA or cDNA using oligonucleotide primers targeted to the appropriate region 
of FKS 7 (viz. encoding amino acid residues 587 to 672 of SEQ ID NO:2). The preferred template source for PGR 
40 amplification is S. cerevisiae genomic DNA. Methods for PGR amplification are widely known in the art. See e.g. PGR 
Protocols: A Guide to Method and Application. Ed. M. Innis etaL, Academic Press (1990). The amplification reaction 
comprises genomic DNA, suitable enzymes, primers, and buffers, and is conveniently carried out in a DNA Thermal 
Cycler (Perkin Elmer Getus. Norwalk, GT). A positive result is determined by detecting an appropriately-sized DNA 
fragment following agarose gel electrophoresis. 

45 

Protein Production Methods 

[0030] The present invention also relates to a substantially purified peptide, or fusion protein, comprising a sub- 
region of glucan synthase that functions as an echinocandin binding site. 
so [0031] Skilled artisans will recognize that the proteins and peptides of the present invention can be synthesized by 
any number of different methods including solid phase chemical synthesis or recombinant methods. Both methods are 
described in U.S. Patent 4,617,149, incorporated herein by reference. 

[0032] The principles of solid phase chemical synthesis are well known in the art and may be found in general texts 
in the area. See, e.g., H. Dugas and C. Penney. Bioorganic Chemistry (1981) Springer-Verlag, New York. 54-92. For 
55 example, peptides may be synthesized by solid-phase methodology utilizing an Applied Biosystems 430 A peptide 
synthesizer (Applied Biosystems. Foster City. CA) and synthesis cycles supplied by Applied Bk)systems. Protected 
amino acids, such as t-butoxycarbonyl-protected amino acids, and other reagents are commercially available from 
many chemical supply houses. 
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[0033] The peptide of the present invention can also be produced by recombinant DNA methods using a cloned 
nucleic acid. Recombinant methods are preferred if a high yield of the peptide is desired. Expression of a cloned nucleic 
acid can be carried out in a variety of suitable hosts, well known to those skilled artisan. For example, the cloned DNA 
is introduced into a host coll by any suitable means, well known to those skilled in the art. While chromosomal integration 
5 of the cbned nucleic acid is within the scope of the present invention, it is preferred that it comprise part of a suitable 
extra-chromosomally maintained expression vector 

[0034] The basic steps in the recombinant production of the peptides of this invention are: 

a) constructing a natural, synthetic or semisynthetic DNA encoding said protein, peptide, or fusion protein; 

10 

b) integrating said DNA into an expression vector in a manner suitable for expressing the protein, either alone or 
as a fusion protein; 

c) transforming or otherwise introducing said vector into an appropriate eucaryotic or prokaryotic host cell, forming 
15 a recombinant host cell, 

d) cuUuring said recombinant host cell in a manner to express the protein; and 

e) recovering and substantially purifying the protein by any suitable means. 

20 

Expressing a Recombinant ECB Binding Domain in Procaryotic and Eucaryotic Host Cells 

[0035] In general, procaryotes are used for cloning DNA sequences and for constructing the vectors of the present 
invention. Procaryotes may also be used in the production of the ECB binding peptide. For example, the Escherichia 
25 coli K12 strain 294 (ATCC No. 31446) is particularly useful, for the prokaryotic expression of foreign proteins. Other 
strains of E. coli, bacilli such as Bacillus subtilis, enterobacteriaceae such as Salmonella typhimuriumor Serraiia marc- 
escans, various Pseudomonas species and other bacteria, such as Streptomyces, may also be employed as host cells 
in the cloning and expression of the recombinant proteins of this inventk^n. 

[0036] Promoter sequences suitable for driving the expressran of genes in procaryotes include ^-lactamase [e.g. 

30 vector pGX2907. ATCC 39344, contains a replicon and p-.lactannase gene], lactose systems [Chang et al., Nature 
(London), 275.615 (1 978); Goeddel etal.. Nature (London). 281 :544 (1 979)], alkaline phosphatase, and the tryptophan 
(trp) promoter system [vector pATHI (ATCC 37695) whk:h is designed to facilitate expression of an open reading frame 
as a trpE fusion protein under the control of the trp promoter]. Hybrid promoters such as the tac pronrwter (isolatable 
from plasmid pDR540. ATCC-37282) are also suitable. Still other bacterial promoters, whose nucleotkJe sequences 

35 are generally known, enable one of skill in the art to ligate such promoter sequences to DNA encoding the proteins of 
the instant invention using linkers or adapters to supply any required restrictwn sites. Pronnoters for use in bacterial 
systems also will contain a Shine-Dalgarno sequence operably-linked to the DNA encoding the desired polypeptides. 
These examples are illustrative rather than limiting. 

[0037] The peptides of this invention may be synthesized de novo, or they may be produced as a fusion protein 

40 comprising the peptide of interest (viz. ECB binding fragment) as a translatkxial fusion with another protein or peptide 
that may be renrtovable by enzymatic or chemical cleavage. It is often observed that expression as a fusion protein 
prolongs the lifespan, increases the yield of a desired peptide, and provides a convenient means of purifying the protein. 
A variety of peptidases (e.g. enterokinase and thrombin) which cleave a polypeptide at specific sites or digest the 
peptides from the amino or carboxy termini (e.g. diaminopeptidase) of the peptide chain are known. Furthermore, 

45 particular chemicals {e.g. cyanogen bromide) cleave a polypeptide chain at specific sites. The skilled artisan will ap- 
preciate the modificatbns necessary to the amino acid sequence (and synthetic or semisynthetic coding sequence if 
recombinant means are employed) to incorporate site-specific internal cleavage sites. See e.g., P. Carter, "Site Specific 
Proteolysis of Fuskxi Proteins", Chapter 13, in Protein Purification: From Molecular Mechanisms to Large Scale Proc- 
esses, American Chemical Society, Washington, D.C. (1990). 

so [0038] The present invention contemplates ECB binding fusion proteins comprising a fragment of glucan synthase 
in fusion with another protein, thereby facilitating isolation, purification, and assay of said ECB binding fragment A 
variety of emt>odiments and methods for producing fusion proteins are known in the art and are suitable for the present 
inventbn. For example, foreign proteins may be fused with the carboxy terminus of Sj26, a 26 kDa glutathione S- 
transferase (GST), encoded by the parasitic helminth Schistosoma japonicum. Such fusion proteins may be expressed 

55 in E. coli or other suitable procaryote, or in eucaryotic hosts, such as yeast. In this r gard, the method and vectors of 
Smith and Johnson are especially suitable (Gene, 67, 31-40, 1988), th entire contents of which is incorporated by 
reference. It is desirable that the fusion protein remain in solution to facilitate ease of purification. 
[0039] In addition to procaryotes, a variety of mammalian cell systems and eucaryotic microorganisms such as yeast 
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are suitable host cells for the recombinant expression of proteins or fusion proteins. The yeast Saccharomyces cere- 
visiae is the most commonty used eucaryotic microorganism. A number of otiner yeasts such as Kluyveromyces lactis 
and Schizosaccharomyces pombe are also suitable. For expression in Saccharomyces, the ptasmid YRp7 (ATCC- 
40053), for exannple, maybe used See, e.g., D. Stinchcomb, et al.. Nature, 282:39 (1979); J. Kingsman etai, Gene, 
5 7:141 (1979); S. Tschemper et at., Gene, 10:157 (1980). Plasmid YRp7 contains the TRP1 gene which provides a 
selectable marker for use in a trpi auxotrophic mutant. For expression in S pombe suitable vectors include those 
containing the nmfl promoter as well as the adh promoter and the SV40 promoter (See e.g. S. Forsburg. Nuc. Acid. 
Res, 21. 2955. 1993). 

10 Purification of Recombinantly-Produced ECB Binding Peptide 

[0040] An expression vector comprising a cloned nucleic acid encoding an ECB binding domain is transformed or 
transfected into a suitable host cell using standard methods. Cells that contain the vector are propagated under con- 
ditions suitable for expression of the peptide. If the gene is controlled by an inducible promoter, suitable growth con- 
's ditions should incorporate the appropriate inducer Recombinantly-produced peptide may be purified from cellular ex- 
tracts of transformed cells by any suitable means. In one process for peptide purification, the gene is modified at the 
5' end to incorporate several hislidine residues at the amino terminus of the peptide. This "hislidine tag' enables a 
single-step protein purification method referred to as 'immobilized metal Ion affinity chromatography" (I MAC), essen- 
tially as described in U.S. Patent 4,569,794 which hereby is incorporated by reference. The I MAC method enables 
20 rapid isolation of substantially pure peptide starting from a crude cellular extract. 

[0041] Other embodiments of the present invention comprise isolated nucleic acid sequences that comprise SEQ 
ID NO:2, wherein said sequences encode amino acid residues 583 to 672 of SEQ ID NO:2. As skilled artisans will 
recognize, the amino acid compounds of the invention can be encoded by a nnultitude of different nucleic acid sequences 
because most of the amino acids are encoded by more than one codon due to the degeneracy of the genetic code 
25 Because these alternative nucleic acid sequences would encode the same amino acid sequences, the present invention 
further comprises these alternate nucleic acid sequences. 

[0042] Nucleic acids encoding an ECB binding domain of SEQ ID NO:2 may be produced by synthetic methods. 
Fragments of the proteins disclosed herein may be generated by any nunnber of suitable techniques, including chemical 
synthesis of a suitable portion of SEQ ID N0:2. proteolytic digestion of SEQ I D NO: 2, or most preferably, by recombinant 

30 DNA mutagenesis techniques, wed known to the skilled artisan. See. e.g. K. Struhl, " Reverse biochemistry: Methods 
and applications for synthesizing yeast proteins in vitro,' Meth. Emymol. 194, 520-535. For example, in a preferred 
method, a nested set of deletion mutations are introduced into the intact FKS1 gene (SEQ ID NO:1) encoding the 
native glucan synthase protein, such that varying amounts of the protein coding region are deleted, either from the 
amino terminal end, or from the carboxyl end of the protein molecule, and wherein said deletions produce molecules 

35 that retain amino acid residues from about 605 to 650, or nx>re preferably amino ackJ residues from about 583 to 672 
of SEQ ID N0.2. Internal fragments of the intact protein can also be produced in which both the carboxyl and amino 
terminal ends are removed. Several nucleases can be used to generate deletions, for example Bal 31 . or in the case 
of a single stranded nucleic acid molecule, mung bean nuclease. For simplicity, it is preferred that the intact FKS1 gene 
be cloned into a single-stranded cloning vector, such as bacteriophage Ml 3. or equivalent. If desired, the resulting 

40 gene deletion fragments can be subcloned into any suitable vector for propagation and expression of said fragments 
in any suitable host cell. It is preferred that the fragments be subcloned into a plasmid, for example pGEX-1 (Smith & 
Johnson, Gene, 67, 31 , 1988). enabling the production of a fusion protein comprising an ECB binding domain 
[0043] The present inventbn provides fragments of the intact glucan synthase protein disclosed herein wherein said 
fragments retain the ability to bind ECB or other echinocandin or papulocandin. 

45 [0044] ECB binding fragments of the intact proteins disclosed herein may be produced as described above, preferably 
using cloning techniques to produce fragments of the intact FKS1 gene. Peptide fragments of glucan synthase or fusion 
proteins comprising a peptide fragment of glucan synthase may be tested for binding activity using any suitable assay 
[0045] The synthesis of nucleic acids is well known in the art. See, e.g., E.L. Brown, R. Belagaje, M.J. Ryan, and H. 
G. Khorana, Methods in Enzymology, 68:109-151 (1 979). The nuc\e'\c acids of this invention could be generated using 

50 a conventional DNA synthesizing apparatus, such as the Applied Biosystems Model 380A or 380B DNA synthesizers 
(Applied Biosystems, Inc., 850 Lincoln Center Drive, Foster City, CA 94404) which employ phosphoramidite chemistry. 
Alternatively, phosphotrtester chemistry may be employed to synthesize the nucleic acids of this invention. [See, e.g., 
M.J. Gait, ed.. Oligonucleotide Synthesis, A Practical Approach. (1984) ] 

[0046] In an alternative methodology, namely PCR, the nucleic acids comprising a portion or all of SEQ ID NO:1 can 
55 be generated from S. cerevisiae genomic DNA using suitable oligonucleotide primers complementary to SEQ ID NO: 
1 or region therein, as described in U.S. Patent No. 4,889.818, which hereby is incorporated by reference. Suitable 
protocols for performing the PCR are disclosed in, for example, PCR Protocols: A Guide to Method and Applications, 
Ed. Michael A. Innis et al , Academic Press, Inc. (1990). 
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[0047] The ribonucleic acids of the present invention nnay be prepared using the polynucleotide synthetic nnethods 
discussed supra, or they nnay be prepared enzymatically using RNA pofymeraso-to transcribe a DNA template. 
[0048] The most preferred systems for preparing the ribonucleic acids of the present Invention employ the RNA 
polymerase from the bacteriophage T7 or the bacleriophage SP6. These RNA polymerases are highly specific, requir- 
ing the insertion of bacteriophage-specilic sequences at the 5' end of the tennplate to be transcribed. See. J. Sambrook, 
etat., supra, at 18.82-18.84. 

[0049] This invention also provides nucleic acids, RNA or DNA, which are complementary to the nucleic acids en- 
coding the ECB binding domain of SEQ ID NO:2. 

[0050] The present Invention also provides probes and primers useful for a variety of molecular biology techniques 
Including, for example, hybridization screens of genomic or subgenomic libraries. A nucleic acid compound comprising 
SEQ ID NO 1, or a complementary sequence thereof, or a fragment thereof, and which Is at least 18 base pairs In 
length, and which will selectively hybridize to Saccharomyces cerevisiae DN/K or mRNA encoding FKS1, Is provided. 
Preferably, the 18 or more base pair compound Is DNA. A probe or primer length of at least 18 base pairs is dictated 
by theoretical and practical considerations. See e.g. B. Wallace and G. Miyada. 

"Oligonucleotide Probes for the Screening of Recombinant DNA Libraries," In Methods in Enzvmotogy . Vol. 152, 
432-442, Academic Press (1987). 

[0051] These probes and primers can be prepared by enzymatic methods well known to those skilled in the art (See 
e.g. Sambrook et af. supra). In a most preferred embodiment these probes and primers are synthesized using chemical 
means as described above. 

[0052] Another aspect of the present invention relates to recombinant DNA cloning vectors and expresskxi vectors 
comprising the nucleic acids of the present Invention. Many of the vectors encompassed within this invention are 
described above. The preferred nucleic acid vectors are those which comprise DNA. The most preferred recombinant 
DNA vectors comprise nuciek; acid encoding the ECB binding domain of SEQ ID NO:2. 

[0053] The skilled artisan understands that choosing the most appropriate cloning vector or expression vector de- 
pends upon a number of factors including the availability of restrrction enzyme sites, the type of host celt into which 
the vector is to be transfected or transformed, the purpose of the transtectlon or transformation {e.g., stable transfor- 
mation as an extrachronrtosomal element, or integration Into the host chromosome), the presence or absence of readily 
assayable or selectable markers (e.g., antibratic resistance and metabolic markers of one type and another), and the 
number of copies of the gene to be present In the host cell. 

[0054] Vectors suitable to carry the nucleic acids of the present Invention comprise RNA viruses, DNA viruses, lytic 
bacteriophages, lysogenic bacteriophages, stable bacteriophages, plasmids, viroids, and the like. The most preferred 
vectors are plasmids. 

[0055] When preparing an expression vector the skilled artisan understands that there are nnany variables to be 
considered, for example, whether to use a constitutive or Inducible promoter Inducible promoters are preferred because 
they enable high level, regulatable expression of an operably finked gene. The skilled artisan will recognize a number 
of inducible promoters which respond to a variety of inducers, for example, carbon source, metal ions, heat, and others. 
The practitioner also understands that the amount of nucleic acid or protein to be produced dk;tates, in part, the selection 
of the expression system. The addition of certain nucleotide sequences Is useful for directing the localizaton of a 
recombinant protein. For example, a sequence encoding a signal peptkJe preceding the coding region of a gene, is 
useful for directing the extra-cellular export of a resulting polypeptide. 

[0056] The present Inventbn also provkJes a method for constructing a recombinant host cell capable of expressing 
the ECB binding domain of SEQ ID NO:2, said method comprising transforming or otherwise introducing into a host 
cell a recombinant DNA vector that comprises an Isolated DNA sequence encoding amino acid residues from about 
583 to 672 of SEQ ID NO:2. Suitable host cells include any strain of E colioi S. cerevisiae that can accommodate 
high level expression of an exogenously introduced gene. Transformed host cells may be cultured under conditions 
well known to skilled artisans such that the ECB binding domain is expressed, thereby producing ECB binding peptide 
in the recombinant host cell. 

[0057] Agents that bind the ECB binding domain may Identify new antifungal compounds. Substances that bind the 
ECB binding peptide can be identified by contacting the peptide with a test compound and monitoring the interaction 
by any suitable means. 

[0058] The instant invention provides a screening method for discovering compounds that bind the ECB binding 
peptide, said method comprising the steps of: 

a) preparing the binding peptide, preferably as a fusion protein; 

b) exposing said peptide or protein to a test compound; and 

c) quantifying the binding of said compound to said peptide by any suitable means. 
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[0059] In one embodiment, a protein comprising a fusion of the 89 amino acid residue ECB binding domain of SEQ 
ID NO:2 and a GST protein is expressed in yeast or E. coli, and purified for use in a microtitcr plate ELISA screen. The 
ELISA screen enables an assay for the displacement of ECB from the ECB binding domain by a test compound Bound 
ECB. or ECB free in solution can be detected using an ECB-specific antibody prepared using standard methods. If a 

5 test compound displaces ECB from the binding domain there will be a diminution in the ELISA signal. This method 
involves coating the wells of a microtiler plate with, for example, a GST-FKS1 fusion protein. After blocking residual 
binding sites the plates are rinsed to remove unbound fusion protein and then incubated with ECB. After rinsing again 
to remove unbound ECB, a test compound is added, incubated, and rinsed to remove unbound test compound or 
displaced ECB. The plates are then incubated with an antibody against ECB that is covalently linked to alkaline phos- 

10 phatasc (anti-ECB-AP). The plates are developed by adding an appropriate substrate, e.g. p-nitrophenyl phosphate 
for colorimetric detection, or 4-methylumbelltferyl phosphate for fluorimetric detection 

[0060] This screening method may be adapted to automated procedures such as a PANDEX® (Baxter-Dade Diag- 
nostics) system, allowing for efficient high-volume screening of potential therapeutic agents. 

[0061] In such a screening protocol an ECB binding peptide is prepared as described herein, preferably using re- 
15 combinant DNA technology. A test compound is introduced into the reaction vessel containing the peptide. 

[0062] Skilled artisans will recognize that IC50 values are dependent on the selectivity of the compound tested. For 
example, a compound with an (C50 which is less than 10 nM is generally considered an excellent candidate for drug 
therapy. However, a compound whk;h has a lower affinity, but is selective for a particular target, may be an even better 
candidate. The skilled artisan will recognize that any information regarding inhibitory activity or selectivity of a particular 
20 compound is beneficial in the pharmaceutical arts. 

[0063] The following examples more fully describe the present invention. Those skilled in the art will recognize that 
the particular reagents, equipment, and procedures described are nnerely illustrative and are not intended to limit the 
present inventbn in any noanner. 

25 EXAMPLE 1 

Expression Vector Encoding the ECB Binding Domain 

[0064] A vector for expressing a f uskxi protein in yeast comprising the ECB binding domain of yeast glucan synthase 

30 and glutathione S-transferase (GST) is prepared as follows. Plasmid pGEX-1 (Smith and Johnson, Gene. 67, 31-40. 
1988) is an £ co// expression vector that comprises the tecpronrwter and the complete coding sequence of Sj26 (viz. 
GST), in which the normal termination codon is replaced by a polylinker containing unique BamH 1 , Snoal . and EcoRI 
restriction sites, followed by a termination codon in all 3 reading frames. A fragment of pGEX-1 containing the described 
GST gene is isolated by any suitable subcloning method, well known to the skilled artisan. It is convenient, but not 

35 necessary, for subsequent cloning steps, to attach to thefragment containing the GST gene of pGEX-1 oligonucleotkJes 
containing specific restriction enzyme sites For convenience, the GST fragment thus described is cloned into the 
multiple cbning site of yeast expression vector pREPI (K. Maundrell, J. Bk>l Chem. 265. 10857, 1990). in the correct 
orientation, downstream of the LEU2 gene, and nmt\ promoter pREP1 also contains an ARS element for replication 
in the host yeast. The resulting plasmid, pREPI-GST. is linearized at any one or more of BamHI, Smal, or EcoRI 

40 sites at the 3' end of the GST fragment, for ctoning in the ECB binding domain. 

[0065] A DNA fragment encoding the ECB binding domain of SEQ ID NO:2 is conveniently prepared by PCR. Oli- 
gonucleotide primers are prepared for priming DNA synthesis on opposite strands from nucleotide positions 1747 
through 2016 of SEQ ID NO:1 . It is convenient to include suitable restriction sites at the appropriate 5* or 3' end of the 
PCR primers for subsequent cloning. The ECB binding fragment so prepared is purified by any suitable method, for 

45 example, isolatran by gel electrophoresis. The purified ECB binding fragment is ligated into pREPI-GST so that the 
ECB binding fragment is linked to the 3' end of the GST gene. This construct. pREPI-GST-ECB. produces a fusron 
protein comprising a GST-ECB binding domain. 

EXAMPLE 2 

so 

E. coli Expression Vector Encoding the ECB Binding Domain 

[0066] A vector for expressing a fusion protein in E. co// comprising the ECB binding domain of yeast glucan synthase 
and glutathione S-transferase (GST) is prepared as follows. Plasmid pGEX-1 (Smith and Johnson, Gene. 67. 31-40. 
55 1 988) is an £ coti expression vector that comprises the tac promoter and the complete coding sequence of Sj26 (viz. 
GST). In which the normal termination codon is replaced by a polylinker containing unique BamHI , Smal . and EcoRI 
restriction sites, followed by a termination codon in all 3 reading frames. 

[0067] A DNA fragment encoding the ECB binding domain of SEQ ID NO:2 is conveniently prepared by PCR. Oli- 
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[0059] In one embodiment, a protein comprising a fusion of the 89 amino acid residue ECB binding domain of SEQ 
ID NO: 2 and a GST protein Is expressed in yeast or £. coli, and purified for use in a microtiter plate ELISA screen. The 
ELISA screen enables an assay for the displacement of ECB from the ECB binding domain by a test compound. Bound 
ECB. or ECB free in solution can be detected using an ECB-specific antibody prepared using standard methods. If a 

5 test compound displaces ECB from the binding domain there will be a diminution in the ELISA signal. This method 
involves coating the wells of a microtiter plate with, for example, a GST-FKSl fusion protein. After blocking residual 
binding sites the plates are rinsed to remove unbound fusion protein and then incubated with ECB. After rinsing again 
to remove unbound ECB, a test compound is added, incubated, and rinsed to remove unbound lest compound or 
displaced ECB. The plates are then incubated with an antibody against ECB that is covalentty linked to alkaline phos- 

^0 phatase (anti-ECB-AP). The plates are developed by adding an appropriate substrate, e.g. p-nitrophenyl phosphate 
for colorimetric detection, or 4-methylumbelliferyl phosphate for fluorinnetric detection. 

[0060] This screening method may be adapted to automated procedures such as a PANDEX® (Baxter-Dade Diag- 
nostics) system, allowing for efficient high-volume screening of potential therapeutic agents. 

[0061] In such a screening protocol an ECB binding peptide is prepared as described herein, preferably using re- 
'5 combinant DNA technology. A test compound is Introduced into the reaction vessel containing the peptide. 

[0062] Skilled artisans will recognize that IC50 values are dependent on the selectivity of the compound tested. For 
example, a compound with an IC50 which is less than 10 nM is generally considered an excellent candidate for drug 
therapy. However, a compound whk:h has a lower affinity, but is selective for a particular target, may be an even better 
candidate. The skilled artisan will recognize that any information regarding inhibitory activity or selectivity of a particular 
20 compound is beneficial in the pharmaceutical arts. 

[0063] The following examples nrrare fully describe the present invention. Those skilled in the art will recognize that 
the particular reagents, equipment, and procedures described are merely illustrative and are not intended to limit the 
present inventbn in any manner. 

25 EXAf^PLE 1 

Expression Vector Encoding the ECB Binding Domain 

[0064] A vector for expressing a fusion protein in yeast comprising the ECB binding domain of yeast glucan synthase 

30 and glutathione S-transf erase (GST) is prepared as follows. Plasmid pGEX-1 (Smith and Johnson, Gene, 67, 31-40, 
1988) is an E. co// expression vector that comprises the tac pronDoter and the complete coding sequence of Sj26 (viz. 
GST), in which the normal termination codon is replaced by a polylinker containing unique BamH1 , Sma^ , and EcoRI 
restriction sites, followed by a termination codon in all 3 reading frames. A fragment of pGEX-1 containing the described 
GST gene is isolated by any suitable subcloning method, well known to the skilled artisan. It is convenient, but not 

3S necessary, for subsequent cloning steps, to attach to the fragment containing the GST gene of pGEX-1 oligonucleotides 
containing specific restriction enzyme sites. For convenience, the GST fragment thus described is cloned into the 
multiple cloning site of yeast expression vector pREPI (K. Maundrell, J. Biol. Chem. 265, 10857, 1990), in the correct 
orientation, downstream of the LEU2 gene, and nmr\ promoter. pREPI also contains an ARS element for replication 
in the host yeast. The resulting plasmid, pREPI-GST, is linearized at any one or more of BamHI, Smal, or EcoRI 

40 sites at the 3' end of the GST fragment, for ck>ning in the ECB binding domain. 

[0065] A DNA fragment encoding the ECB binding domain of SEQ ID N0:2 is conveniently prepared by PCR. Oli- 
gonucleotide primers are prepared for priming DNA synthesis on opposite strands from nucleotide positions 1747 
through 2016 of SEQ ID NO:1 . It is convenient to Include suitable restriction sites at the appropriate 5' or 3' end of the 
PCR primers for subsequent cloning. The ECB binding fragment so prepared is purified by any suitable method, for 

45 example, isolation by gel electrophoresis. The purified ECB binding fragment is ligated into pREP1-GST so that the 
ECB binding fragment is linked to the 3* end of the GST gene. This construct, pREP1-GST-ECB, produces a fusion 
protein comprising a GST-ECB binding domain. 

EXAMPLE 2 

so 

E. CO// Expression Vector Encoding the ECB Binding Domain 

[0066] A vector for expressing a fusion protein inE. co//comprisingthe ECB binding domain of yeast glucan synthase 
and glutathione S-transferase (GST) is prepared as follows. Plasmid pGEX-1 (Smith and Johnson, Gene, 67, 31-40, 
55 1 988) is an £ coli expression vector that comprises the tac promoter and the complete coding sequence of Sj26 (viz. 
GST). In which the normal terminatkxi codon is replaced by a polylinker containing unique BamHI , Smal , and EcoRI 
restriction sites, followed by a termination codon in all 3 reading frames. 

[0067] A DNA fragment encoding the ECB binding domain of SEQ ID N0:2 is conveniently prepared by PCR. Oli- 
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gonucleotide primers are prepared for priming DNA synthesis on opposite strands, from nucleotide positions 1747 
through 2016 of SEQ ID NO:1. It is convenient to design into the oligonucleotide sequence suitable restriction sites at 
the termini for subsequent cloning steps. The ECB binding fragment so prepared is purified by any suitable method, 
for example, isolation from a gel following electrophoresis. The purified ECB binding fragment is ligated into pGEX-1 
5 so that the ECB binding fragment is linked to the 3' end of the GST gene. This construct. pGST-ECB, produces a fusion 
protein comprising a GST-ECB binding domain. 

EXAMPLE 3 

10 Expression of ECB Fusion Protein in S. pombe 

[0068] Expression plasmid pREP1 -GST-ECB (Example 1) is transformed into any suitable strain of S. pombe, for 
example, a leul strain (See e.g. R. Sikorski & P Hieter, Genetics, 122, 19-26, 1989; K. Maundrell, J. Biol. Chem. 265, 
10857, 1990) using standard methods, for example, spheroplast transformation, or lithium acetate transformation {See 

IS e.g. Sambrook etal. Sup/a; Okazaki etal. Nuc. Acid Res 18. 6485-89 (1990); Moreno eta/. Meth.Enzym. 194, 795-823 
(1 991). Transformants, chosen at random, are tested for the presence of the plasmid by agarose gel electrophoresis 
using quick plasmid preparations. Id. Transformants are grown overnight under conditrans suitable to induce the nmt\ 
promoter, for example, in minimal medium lacking thiamine (Beach & Nurse. Nature, 290, 140, 1981). The overnight 
culture was diluted into fresh medium and allowed to grow to mid-log phase. The induced-culture was pelleted by 

20 centrif ugation in preparation for protein purification. 

EXAMPLE 4 

Affinity Purification of a Recombinantly-Produced ECB Binding Domain 

25 

[0069] Overnight cultures of transformed E. colior yeast cells. (See e.g. Example 3), are lysed by sonication with 
glass beads, or by spheroplast formation in MTPBS (150 mM NaCI, 1 6 mM Na2HP04, 4 mM NaH2P04 (pH 7.3) and 
including 1% Triton X-100 (BDH Chemicals). Lysed cells are subjected to centrrfugation at 10,000 x g for 5 minutes at 
4° C. The supernatant is mixed on a rotating platform with 1 to 2 ml 50% glutathione-agarose beads (sulphur linkage, 
30 Sigma). After absorption for 2 minutes, beads are collected by brief centrif ugatk>n at 500 x g and washed 3 times with 
50 ml MTPBS. Fusion protein is eluted by competition with free glutathione, using 2x2 minute washes with 1 bead 
volume of 50 mM Tris HCI, pH 8, containing 5 mM reduced glutathione (Sigma), pH 7.5. 
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SFQUKNCF LI ST IMG 



(1) GENERAL INFORMATION: 



(i) APPLICATJT; ELI LILLY AND COMPANY 

(B) STREET: Lilly Corporate Center 

(C) CITY: Indianapolis 

(D) STATE: Indiana 

(E) COUNTRY; United States of America 
(Fi ZIP: 46285 

(ii) TITLE OF I^A/ENTICN: Echinocandin Binding 3i::e of 
i,3-B-Glucan Synthase 

(iii) NUMBER OF SEQUENCES: 2 

20 (iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: A. M. Denholm 
(Bl STREET: Erl Wood Manor 

(C) CITY: Windlesham 

(D) STATE: Surrey 

{E) COUNTRY: United Kingdom 
(F) ZIP: GU20 6PH 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-IX)S/MS-CKDS 

(D) SOFTWARE: Patencin Release #1.0, Version #1.30 

(2) INFORMATION FOR SEQ ID NO : 1 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5631 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
55 O) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(ix) FEATURE: 

(A J NAME /KEY: CDS 

(B) LOCATION: 1..5628 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



ATG AAC ACT CAT CAA CAA CCT TAT CAG GGC CAA ACG GAG TAT ACC CAG 48 
Met Asn Thr Asp Gin Gin Pro Tyr Gin Gly Gin Thr Asp Tyr Thr Gin 
15 10 15 

GGA CCA GGT AAC GGG CAA AGT CAG GAA CAA GAG TAT GAC CAA TAT GGC 9 6 

Gly Pro Gly Asn Gly Gin Sor Gin Glu Gin Asp Tyr Asp Gin Tyr Gly 
20 25 30 

CAG CCT TTG TAT CCT TCA C/iA GCT GAT GGT TAG TAC GAT CCA AAT GTC 14 4 

Gin Pro Lo'j Tyr Pro Ser Gin Ala Asp Gly Tyr Tyr Asp Pro Asn Val 
55 35 40 45 
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GCT GCT GGT ACT GAA GCT GAT ATG TAT GGT CAA CAA CCA CCA A/^C GAG ;92 

Aid Aid Gly Thr Glu Ala Asp Met Tyr GIv Gin Gin Pro Pro Asn Glu 

50 55 6C 

5 

TCT TAC GAG CAA GAC TAG ACA AAC GGT GAA TAC TAT GGT CAA CCG CCA 2 40 

Ser Tyr Asp Gla Asp Tyr Thr Asn Gly Glu Tyr Tyr Gly Gin Pro Pro 

65 70 75 fiO 

AAT ATG GCT GCT CAA GAC GGT GAA AAC TTC TCG GAT TTT AGC AGT TAC 2 38 

Asn Met Ala Ala Gin Asp Gly Glu Asn Phc Scr Asp Phe Ser Ser Tyr 

10 85 90 95 

GGC CCT CCT GGA ACA CCT GGA TAT GAT AGC TAT GGT GGT CAG TAT ACC 116 

Gly Pro Pro Gly Thr Pro Gly Tyr Asp Ser Tyr Gly Gly Gin Tyr Thr 

100 105 UO 



15 



20 



GCT TCT CAA ATG AGT TAT GGA GAA CCA AAT TCG TCG GGT ACC TCG ACT ^34 

Ala Ser Gin Mec Ser Tyr Gly Glu Pro Asn Ser Ser Gly Thr Ser Thr 
115 120 125 

CCA ATT TAC GGT AAT TAT GAC CCA AAT GCT ATC GCT ATG GCT TTG CCA 4 32 

Pro lie Tyr Gly Asn Tyr Asp Pro Asn Ala lie Ala Met Ala Leu Pro 
130 135 140 

AAT GAA CCT TAT CCC GCT TGG ACT GCT GAC TCT CAA TCT CCC GTT TCG 480 

Asn Glu Pro Tyr Pro Ala Trp Thr Ala Asp Ser Gin Ser Pro Val Ser 
145 150 155 160 | 

ATC GAG CAA ATC GAA GAT ATC TTT ATT GAT TTG ACC AAC AGA CTC GGG 52j0 

lie Glu Gin lie Glu Asp lie Phe He Asp Leu Thr Asn Arg Leu Gly 

2S 165 170 175 

TTC CAA AGA GAC TCC ATG AGA AAT ATG TTT GAT CAT TTT ATG GTT CTC 576 

Phe Gin Arg Asp Ser Met Arg Asn Met Phe Asp His Phe Met Val Leu 

180 185 190 



30 



35 



TTG GAC TCT AGG TCC TCG AGA ATG TCT CCT GAT CAA GCT TTA CTA TCT 6 24 

Leu Asp Ser Arg Ser Ser Arg Met Ser Pro Asp Gin AIa Leu Leu Ser 

195 200 205 

TTA CAT GCC GAC TAC ATT GGT GGC GAT ACT GCT AAC TAT AAA AAA TGG 6 72 

Leu His Ala Asp Tyr lie Gly Gly Asp Thr Ala Asn Tyr Lys Lys Trp 

210 215 220 

TAT TTT GCT GCT CAG TTA GAT ATG GAT GAT GAA ATT GGT TTT AGA AAT 7 2C 

Tyr Phe Ala Ala Gin Leu Asp Met Asp Asp Glu lie Gly Phe Arq Asn 

225 230 235 240 

ATG AGT CTT GGA AAA CTC TCA AGG AAG GCA AGA AAA GCT AAG AAG AAA 7 68 

Met Ser Leu Gly Lys Leu Ser Arg Lys Ala Arg Lys Ala Lys Lys Lys 

"^0 245 250 255 

AAC AAG AAA GCA ATG GAA GAG GCC AAT CCC GAA GAC ACT GAA GAA ACT 816 

Asn Lys Lys Ala Met Glu Glu Ala Asn Pro Glu Asp Thr Glu Glu Thr 

260 265 270 



45 



SO 



TTA AAC AAA ATT GAA GGC GAC AAC TCC CTA GAG GCT GCT GAT TTT AGA 864 

Leu Asn Lys He Glu Gly Asp Asn Ser Leu Glu Ala Ala Asp Phe Arg 

275 230 285 

TGG AAG GCC AAG ATG AAC CAG TTG TCT CCC CTG GAA AGA GTT CGT CAT 912 

Trp Lys Ala Lys Met Asn Gin Leu Ser Pro Leu Glu Arg Val Arg His 

290 295 300 
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ATC GCC TTA TAT CTG TTA TGT TGG GGT GAA GCT AAT CAA GTC AG A TTC 960 
lie Ala Leu Tyr Leu Leu Cys Txp Gly Glu Ala Asn Gin Val Arq ?he 
3C5 310 315 320 

ACT GCT GAA TGT TTA TGT TTT ATC TAC AAG TGT GCT CTT GAC TAC TTG 1008 
Thr Ala Glu Cys Leu Cys Phe lie Tyr Lys Cys Ala Leu Asp Tyr Leu 
325 330 335 

GAT TCC CCT CTT TGC CAA CAA CGC CAA GAA CCT ATG CCA GAA GGT GAT :056 
Asp Ser Pro Leu Cys Gin Gin Arg Gin Glu Pro Met Pro Glu Gly Asp 
340 345 350 

TTC TTG AAT AGA GTC ATT ACG CCA ATT TAT CAT TTC ATC AGA AAT CAA 1104 
Phe Leu Asn Arg Val lie Thr Pro lie Tyr His Phe lie Arg Asn Gin 
3bb 360 36S 

GTT TAT GAA ATT GTT GAT GGT CGT TTT GTC AAG CGT GAA AGA GAT CAT 1152 
Val Tyr Glu lie Val Asp Gly Arg Phe Val Lys Arg Glu Arg Asp His 
370 375 380 

AAC AAA ATT GTC GGT TAT GAT GAT TTA AAC CAA TTG TTC TGG TAT CCA 1200 
Asn Lys lie Val Gly Tyr Asp Asp Leu Asn Gin Leu Phe Trp Tyr Pro 
385 390 395 400 

GAA GGT ATT GCA AAG ATT GTT CTT GAA GAT GGA AC A AAA TTG ATA GAA 1248 
Glu Gly lie Ala Lys lie Val Leu Glu Asp Gly Thr Lys Leu He Glu 

405 410 415 I 

CTC CCA TTG GAA GAA CGT TAT TTA AGA TTA GGC GAT GTC GTC TGG GAT 129^ 
r,eu Pro Leu Glu Glu Arg Tyr Leu Arg Leu Gly Asp Val Val Trp Asp 
25 420 425 430 

GAT GTA TTC TTC AAA ACA TAT AAA GAG ACC CGT ACT TGG TTA CAT TTG 1J44 
Asp Val Phe Phe Lys Thr Tyr Lys Glu Thr Arg Thr Trp Leu His Leu 
435 440 445 



15 



20 



30 



35 



GTC ACC AAC TTC AAC CGT ATT TGG GTT ATG CAT ATC TCC ATT TTT TGG 13 92 

Val Thr Asn Phe Asn Arg lie Trp Val Met His He Ser He Phe Trp 
450 455 460 

ATG TAC TTT GCA TAT AAT TCA CCA ACA TTT TAC ACT CAT AAC TAT CAA 1440 
Met: Tyr Phe Ala Tyr Asn Ser Pro Thr Phe Tyr Thr His Asn Tyr Gin 
465 470 475 480 

CAA TTG GTC GAC AAC CAA CCT TTG GCT GCT TAC AAG TGG GCA TCT TGC 1488 
Gin Let! Val Asp Asn Gin Pro Leu Ala Ala Tyr Lys Trp Ala Ser Cys 
485 490 495 

GCA TTA GGT GGT ACT GTC GCA AGT TTG ATT CAA ATT GTC GCT ACT TTG 15 36 

Ala Leu Gly Gly Thr Val Ala Ser Leu He Gin He Val Ala Thr Leu 
40 500 505 510 

TGT GAA TGG TCA TTC GTT CCA AGA AAA TGG GCT GGT GCT CAA CAT CTA 1584 
Cys Glu Trp Ser Phe Val Pro Arg Lys Trp Ala Gly Aia Gin His Leu 
515 520 52S 



45 



50 



TCT CGT AGA TTC TGG TTT TTA TGC ATC ATC TTT GGT ATT AAT TTG GGT 16 32 

Ser Arg Arg Phe Trp Phe Leu Cys lie He Phe Gly He Asn Leu Gly 
530 535 540 

CCT ATT ATT TTT GTT TTT GCT TAC GAC AAA GAT ACA GTC TAC TCC ACT 1680 

Pro He He Phe Val Phe Ala Tyr Asp Lys Asp Thr Val Tyr Ser Thr 

545 550 555 560 

GCT GCA CAC GTT GTT GCT GCT GTT ATG TTC TTT GTT GCG GTT GCT ACC 1728 
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Ala Aid His Vdl Val Ala Ala Va 1 Met: ?he Phe Va: Ala Vai Ala Thr 
^6*^ S70 S 75 

ATC ATA TTC TTC TCC ATT ATG CCA TTG GGG GGG TTG TTT AGO TCA TAT 17 7 6 

lie lie ?he Phe Set lie Met Pro Lou Gly Gly Leu Phe Thr Ser Tyr 
58G S8S 590 

ATG AAA AAA TCT ACA AGG CGT TAT GTT CCA TCT CAA ACA TTC ACT GCT 1^24 

Met Lys I//S Scr Thr Arg Arg Tyr Vdl Ala Ser Gin Thr Phe Thr Ala 

595 600 605 

GCA TTT GCC CCT CTA CAT GGG TTA GAT AGA TGG ATG TCC TAT TTA GTT 157 2 

Ala Phe Ala Pro Leu His Gly Leu Asp Arg Trp Met Ser Tyr Leu Val 

610 615 620 

TGG GTT ACT GTT TTT GCT GCC AAA TAT TCA G AA TCG TAC TAG TTT TTA 19 2 0 

Trp Vol Thr Val Phe Ala Ala Lys Tyr Ser Glu Ser Tyr Tyr Phe Leu 

625 630 635 €40 

GTT TTA TCT TTG AGA GAT CCA ATT AGA ATT TTG TCC ACC ACT GCA ATG 1963 

Val Lew Ser T.cu Arg Asp Pro Tie Arg He Leu Ser Thr Thr Ala Met 
645 650 655 

20 AGG TGT ACA GGT GAA TAC TGG TGG GGT GCG GTA CTT TGT AAA GTG CAA 2016 

Arg Cys Thr Gly Glu Tyr Trp Trp Gly Ala Val Leu Cys Lys Val Gin 
660 665 670 

CCC AAG ATT GTC TTA GGT TTG GTT ATC GCT ACC GAC TTC ATT CTT TTC 20 5*4 

Pro Lys He Val Leu Gly Leu Val He Ala Thr Asp Phe He Lei: Phe ' 

675 680 685 

TTC TTG GAT ACC TAC TTA TGG TAC ATT ATT GTG AAT ACC ATT TTC TCT 2112 

Phe Leu Asp Thr Tyr Leu Trp Tyr He He Val Asn Thr He Phe Ser 

690 595 700 

GTT GGG AAA TCT TTC TAT TTA GGT ATT TCT ATC TTA ACA CCA TGG AGA 2150 

Val Gly Lys Ser Phe Tyr Leu Gly He Ser He Leu Thr Pro Trp Arg 

705 710 715 720 

AAT ATC TTC ACA AGA TTG CCA AAA AGA ATA TAC TCC AAG ATT TTG GCT 2 2 08 

Asn He Phe Thr Arg Lou Pro Lys Arg He Tyr Ser Lys He Leu Ala 
725 730 735 

35 ACT ACT GAT ATG GAA ATT AAA TAC AAA CCA AAG GTT TTG ATT TCT CAA 22 5 6 

Thr Thr Asp Met Glu He Lys Tyr Lys Pro Lys Val Leu Tie Ser Gin 
740 745 750 

GTA TGG AAT GCC ATC ATT ATT TCA ATG TAC AGA GAA CAT CTC TTA GCC 2 3 04 

Val Trp Asn Ala He He He Ser Met Tyr Arg Glu His Leu Leu Ala 

755 760 765 

40 

ATC GAC CAT GTA CAA AAA TTA CTA TAT CAT CAA GTT CCA TCT GAA ATC 2352 

He Asp His Val Gin Lys Leu Leu Tyr His Gin Val Pro Ser Glu He 

770 775 780 

GAA GGT AAA AGA ACT TTG AGA GCT CCT ACC TTC TTT GTT TCT CAA GAT 2 4 00 

45 Glu Gly Lys Arg Thr Leu Arg Ala Pro Thr Phe Phe Val Ser Gin Asp 

785 790 795 SOO 

GAC AAT AAT TTT GAG ACT GAA TTT TTC CCT AGG GAT TCA GAG GCT GAG 2 44 8 

Asp Asn Asn Phe Glu Thr Glu Phe Phe Pro Arg Asp Ser Glu Ala Glu 
805 810 815 

SO CGT CGT ATT TCT TTC TTT GCT CAA TCT TTG TCT ACT CCA ATT CCC GAA 249 6 

Arg Arg He Ser Phe Phe Ala Gin Ser Leu Ser Thr Pro He Pro Glu 
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820 825 330 

CCA CTT CCA GTT CAT AAC ATG CCA ACG TTC ACA GTA TTG ACT OCT CAC 25 4 4 

Pro Leu Pro Val Asp Asn Met Pro Thr Phe Thr Val Leu Thr Pro His 

835 R40 845 

TAC GCG GAA AGA ATT CTG CTG TCA TTA AGA CAA ATT ATT CGT GAA GAT 2'j92 

Tyr Ala Glu Arg lie Leu Leu Sec Leu Arg Glu lie He Arq Glu Asp 

850 855 860 

GAC CAA TTT TCT AGA GTT ACT CTT TTA GAA TAT CTA AAA CAA TTA CAT 

Asp Gin Phe Ser Arg Val Thr Leu Leu Glu Tyr Leu Lys Gin Leu His 

ft65 87C 875 fl8C 

CCC GTT GAA TGG GAA TGT TTT GTT AAG GAT ACT AAG ATT TTG GCT CAA 

Fro Val Glu Trp Glu Cys Phe Val Lys Asp Thr Lys lie Leu Ala GLu 
885 890 895 

G.AA ACC GCT GCC TAT GAA GGA AAT GAA AAT GAA GCT GAA AAG GAA GAT 21 ^^ 

Glu Thr Ala Ala Tyr Glu Gly Asn Glu Asn Glu Ala Glu Lys Glu Asp 

900 90S 910 

GCT TTG AAA TCT CAA ATC GAT GAT TTG CCA TTT TAT TGT ATT GGT TTT 2 731 

Ala Leu Lys Ser Gin lie Asp Asp Leu Pro Phe Tyr Cys He Gly Phe 

20 915 920 925 

AAA TCT GCT GCT CCA GAA TAT ACA CTT CGT ACG AGA ATT TGG GCT TCT 28 3 2 

t.ys Ser Ala Ala Pro Glu Tyr Thr Leu Arg Thr Arg He Trp Ala Ser • 

930 935 940 



10 



15 



25 



30 



TTG AGG TCG CAG ACT CTA TAT CGT ACC ATT TCA GGG TTC ATG AAT TAT 

r.p\i Arg Ser Gin Thr Leu Tyr Arg. Thr lie Ser Gly Phe Met .Asn Tyr 

945 950 955 960 

TCA AGA GCT ATC AAA TTA CTG TAT CGT GTG GAA AAT CCT GAA ATT GTT 2 92R 

Ser Arg Ala lie Lys [.eu Leu Tyr Arg Val Glu Asn Pro Glu He Val 

965 970 975 

CAA ATG TTT GGT GGT AAT GCT GAA GGC TTA GAA AGA GAG CTA GAA AAG 2 9 76 

Gin Met Phe Gly Gly Asn Ala Glu Gly Leu Glu Arg Glu Leu Glu Lys 

980 985 990 

ATG GCA AGA AGA AAG TTT AAA TTT TTG GTC TCT ATG CAG AGA TTG GCT 3 02 4 

Met Ala Arg Arg Lys Phe Lys Phe Leu Val Ser Met Gin Arg Leu Ala 

35 995 1000 1005 

AAA TTC AAA CCA CAT GAA CTG GAA AAT GCT GAG TTT TTG TTG AGA GCT ^ : :2 

Lyi? Phe Lys Pro His Glu Leu Glu Asn Ala Glu Phe Leu Leu Arg Ala 
1010 1015 102O 



40 



45 



TAC CCA GAC TTA CAA ATT GCC TAC TTG GAT GAA GAG CCA CCT TTG ACT : ) 

Tyr Pro Asp Leu Gin He Ala Tyr Leu Asp Glu Glu Pro Pro Lou Thr 

102b 1030 1035 1040 

GAA GGT GAG GAG CCA AGA ATC TAT TCC GCT TTG ATT GAT GGA CAT TGT • > 

Glu Gly Glu Glu Pro Arg He Tyr Ser Ala Leu lie Asp Gly His Cy^3 

1045 1050 1055 

GAA ATT CTA GAT AAT GGT CGT AGA CGT CCC AAG TTT AGA GTT CAA TTA . -S 

Glu He Leu Asp Asn Gly Arg Arg Arg Pro Lys Phe Arg Val Gin Leu 

1060 1065 1070 

TCT GGT AAC CCA ATT CTT GGT GAC GGT AAA TCT GAT AAC CAA AAC CAT 4 

Ser Gly Asn Pro He Leu Gly Asp Gly Lys Ser Asp Asn Gin Asn His 

50 1075 1080 1085 
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20 



OCT TTG ATT TTT TAC AGA GGT GAA TAG ATT CAA TTA AT? GAT GCC AAC .3312 
Aia Leu lie Phe Tyr Arg Gly Glu Tyr lie Gin Leu lie Asp Ala Asn 
109C 1095 1100 

CAA GAT AAC TAC TTG GAA GAA TGT CTG AAG ATT AGA TC7 GTA TTG GOT 
Gin Asp A^n Tyr Leu Glu Glu Cys Leu Lya Tic Arg Ser Val Leu Ala 
1105 li:C 1115 11^0 

GAA T?T GAG GAA TTG AAC GTT GAA CAA GTT AAT CCA TAT GCT CCC GGT 3 4 OR 

Glu Phe Glu Glu Leu Ayn Val Glu Gin Val Asn Pro Tyr Ala Pro Gly 
112S 1130 113b 

TTA AGG TAT GAG GAG CAA ACA ACT AAT CAT CCT GTT GCT ATT GTT GGT Hbo 
Leu Arg Tyr Glu Glu Gin Thr Thr Asn His Pro Val Ala lie Val Gly 
1140 1145 1150 

GCC ACA GAA TAC ATT TTC TCT GAA AAC TCT GGT GTG CTG GGT GAT GTG 3 504 

Ala Arg Glu Tyr lie Phe Ser Glu Asn Ser Gly Val Leu Gly Asp Val 
1155 1160 1165 

GCC GCT GGT AAA GAA CAA ACT TTT GGT ACA TTA TTT GCG CGT ACT TTA 3 552 

Ala Ala Gly Lys Glu Gin Thr Phe Gly Thr Leu Phe Ala Arg Thr Leu 
1170 1175 118C 

TCT CAA ATT GGT GGT AAA TTG CAT TAT GGT CAT CCG GAT TTC ATT AAT 3 6 0C 

Ser Gin lie Gly Gly Lys Leu His Tyr Gly His Pro Asp Phe lie Asn 
11S5 1190 1195 1200 ' 

r 

GCT ACG TTT ATG ACC ACT AGA GGT GGT GTT TCC AAA GCA CAA AAG GGT 364H 
Ala Thr Phe Met Thr Thr Arg Gly Gly Val Ser Lys Ala Gin Lys Gly 
1205 1210 1215 

TTG CAT TTA AAC GAA GAT ATT TAT GCT GGT ATG AAT GCT ATG CTT CGT 3696 
Leu His Leu Asn Glu Asp He Tyr Ala Gly Met Asn Ala Met Leu Arq 
1220 1225 1230 

GGT GGT CGT ATC AAG CAT TGT GAG TAT TAT CAA TGT GGT AAA GOT AGA 37 44 

Gly Gly Arg lie Lys His Cys Glu Tyr Tyr Gin Cys Gly Lys Gly Arg 
1235 1240 1245 

GAT TTG GGT TTC GGT ACA ATT CTA AAT TTC ACT ACT AAG ATT GGT GCT 3 792 

Asp Leu Gly Phe Gly Thr lie Leu Asn Phe Thr Thr Lys He Gly Ala 
1250 1255 1260 

GGT ATG GGT GAA CAA ATG TTA TCT CGT GAA TAT TAT TAT CTG GGT ACC 384 0 

Gly Met Gly Glu Gin Mec Lou Ser Arg Glu Tyr Tyr Tyr Leu Gly Thr 
1265 1270 1275 1280 

CAA TTA CCA GTG GAC CGT TTC CTA ACA TTC TAT TAT GCC CAT CCT GGT 38fl3 
Gin Leu Pro Val Asp Arg Phe Leu Thr Phe Tyr Tyr Ala His Pro Gly 
1285 1290 1295 

TTC CAT TTG AAC AAC TTG TTC ATT CAA TTA TCT TTG CAA ATG TTT ATG 39 3 6 

Phe His Leu Asn Asn Leu Phe He Gin Leu Ser Leu Gin Met Phe Met 
1300 1305 1310 



30 



35 



45 



SO 



TTG ACT TTG GTG AAT TTA TCT TCC TTC GCC CAT GAA TCT ATT ATG TGT 39^4 
Leu Thr Leu Val Asn Leu Ser Ser Leu Ala His Glu Ser He Met Cys 
1315 1320 1325 

ATT TAC GAT AGG AAC AAA CCA AAA ACA GAT GTT TTG GTT CCA ATT GGG 4D32 
He Tyr Asp Arg Asn Lys Pro Lys Thr Asp Val Leu Val Pro He Gly 
1330 1335 1340 
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TGT TAC AAC TTC CAA CCT GCG GTT GAT TGG GTG AGA CGT TAT ACA TTG 

Cys Tyr Asn Phe Gin Pro Ala Val Asp Trp Val Arg Arg Tyr Thr Lgu 

1345 1350 13S5 1360 

7CT ATT TTC ATT GT7 TTC TGG ATT GCC T7C GTT CCT ATT GTT GTT CAA 

3er lie Phe lie Val Phe Trp He Ala Phe Val Pro He Val Val Gin 
1365 1370 1375 

GAA CTA ATT GAA CGT GGT CTA TGG AAA GCC ACC CAA AGA TTT TTC TGC 

Glu Leu rie Glu Arg Gly Leu Trp Lys Ala Thr Gin Arg Phe Phe Cys 
1380 1385 1390 

CAC CTA TTA TCA TTA TCC CCT ATG TTC GAA GTG TTT GCG GGC CAA ATC 

His Leu Leu Ser Leu Ser Pro Met Phe Glu Val Phe Ala Gly Gin He 
1395 1400 1405 

TAC TCT TCT GCG TTA TTA AGT GAT TTA GCA ATT GGT GGT GCT CGT TAT 

Tyr Ser Ser Ala Leu Leu Ser Asp Leu Ala lie Gly Gly Ala Arg Tyr 
141D 1415 1420 

ATA TCC ACC GGT CGT GGT TTT GCA ACT TCT CGT ATA CCA TTT TCA ATT 

lie Ser Thr Gly Arg Gly Phe Ala Thr Ser Arg He Pro Phe Ser He 

1425 1430 1435 1440 

TTG TAT TCA AGA TTT GCA GGA TCT GCT ATC TAC ATG GGT GCA AGA TCA 

Leu Tyr Ser Arg Phe Ala Gly Ser Ala He Tyr Met Gly Ala Arg Ser 
1445 1450 1455 

ATG TTA ATG TTG CTG TTC GGT ACT GTG GCA CAT TGG CAA GCT CCA CTA 

Mec Leu Met Leu Leu Phe Gly Thr Val Ala His Trp Gin Ala Pro Leu 
1460 1465 1470 

CTG TGG TTT TGG GCC TCT CTA TCT TCA TTA ATT TTT GCG CCT TTC GTT 

Leu Trp Phe Trp Ala Ser Leu Ser Ser Leu He Phe Ala Pro Phe Val 
1475 1480 1485 

TTC AAT CCA CAT CAG TTT GCT TGG GAA GAT TTC TTT TTG GAT TAC AGG 

Phe Asn Pro His Gin Phe Ala Trp Glu Asp Phe Phe Leu Asp Tyr Arg 
1490 1495 1500 

GAT TAT ATC AGA TGG TTA TCA AGA GGT AAT AAT CAA TAT CAT AGA AAC 

Asp Tyr He Arg Trp Leu Ser Arg Gly Asn Asn Gin Tyr His Arg Asn 

1505 1510 1515 1520 

TCG TGG ATT GGT TAC GTG AGG ATG TCT AGG GCA CGT ATT ACT GGG TTT 

Ser Trp He Gly Tyr Val Arg Met Ser Arg Ala Arg He Thr Gly Phe 
1525 1530 1535 

AAA CGT AAA CTG GTT GGC GAT GAA TCT GAG AAA GCT GCT GGT GAC GCA 

Lys Arg Lys Leu Val Gly Asp Glu Ser Glu Lys Ala Ala Gly Asp Ala 
1540 1545 1550 

AGC AGG GCT CAT AGA ACC AAT TTG ATC ATG GCT GAA ATC ATA CCC TGT 

^.e.r Arg Ala His Arg Thr Asn Leu He Met Ala Glu He He Pro Cys 
1S5S 1560 1565 

GCA ATT TAT GCA GCT GGT TGT TTT ATT GCC TTC ACG TTT ATT AAT GCT 

Ala Tie Tyr Ala Ala Gly Cys Phe lie Ala Phe Thr Phe He Asn Ala 
1570 1575 1580 

CAA ACC GGT GTC AAG ACT ACT GAT GAT GAT AGG GTG AAT TCT GTT TTA 

Gin Thr Gly Val Lys Thr Thr Asp Asp Asp Arg Val Asn Ser Val Leu 

1585 1590 1595 1600 



CCT ATC ATC ATT TGT ACC TTG GCG CCA ATC GCC GTT AAC CTC GGT GTT 
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Arg lie lie He Cys Thr Leu Ala Pro lie Aid Val Asn I.eu Gly Val 
1635 16:0 1615 

CTA TTC TTC TGT ATG GGT ATG TCA TGC TGC TCT GGT CCC TTA TTT GOT 4''^'-^n 

Leu Phe Phe Cys Met Gly Met Ser Cys Cys Set Giy Pro T.eu Phe Gly 
1620 162S 1630 

ATG TGT TGT AAG AAG AC A GGT TCT GTA ATG GCT GQA ATT GCC CAC GGT 49 44 

Met Cys Cys Lys Lys Thr Gly Ser Val Met Ala Gly Zle Ala His Giy 

1535 1640 :64S 

CTT CCT GTT ATT GTC CAC ATT GCC TTT TTC ATT GTC ATG TGG GTT TTG 4 9 92 

Val Ala Val He Val His He Ala Phe Phe He Val Met Trp Val Leu 
1650 1655 1660 

GAG AGC TTC AAC TTT GTT AGA ATG TTA ATC CCA GTC GTT ACT TGT ATC 5 04 0 

Glu Ser Phe Asn ?he Val Arg >1et Leu He Gly Val Val Thr Cys He 
IS 1665 1670 1675 1530 

CAA TGT CAA AGA CTC ATT TTT CAT TGC ATG ACA GCG TTA ATG TTG ACT 508 R 

Gin Cys Gin Arg Leu He Phe His Cys Met Thr Ala Leu Met Leu Thr 
16B5 1690 1695 
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CGT CAA TTT AAA AAC GAT CAT GCC AAT ACA GCC TTC TGG ACT GGT AAG 513 6 

Arg Glu Phe Lys Asn Asp His Ala Asn Thr Ala Phe Trp Thr Cly Lys 

1700 1705 1710 

TGG TAT GGT AAA GGT ATG GGT TAC ATG GCT TGG ACC CAG CCA ACT AGA 518^ 

Trp Tyr Gly Lys Gly Met Gly Tyr Met Ala Trp Thr Gin Pro Ser Arg I 

1715 1720 1725 ' 

GAA TTA ACC GCC AAG GTA ATT GAG CTT TCA GAA TTT GCA GCT GAT TTT 523 2 

Glu Leu Thr Ala Lys Val He Glu Leu Ser Glu Phe Ala Ala Asp Phe 
1730 1735 1740 

GTT CTA GGT CAT GTG ATT TTA ATC TGT CAA CTG CCA CTC ATT ATA ATC 5280 

Val Leu Gly His Val He Leu He Cys Gin Leu Pro Leu He He He 
30 1745 1750 1755 1760 

CCA AAA ATA GAT AAA TTC CAC TCG ATT ATG CTA TTC TGG CTA AAG CCC 5328 

Pro Lys He Asp Lys Phe His Ser lie Met Leu Phe Trp Leu Lys Pro 
1765 1770 1775 

TCT CGT CAA ATT CGT CCC CCA ATT TAC TCT CTG AAG CAA ACT CGT TTG 537 6 

35 Ser Arg 'Gin He Arg Pro Pro He Tyr Ser Leu Lys Gin Thr Arg Leu 

1780 1735 1790 

CGT AAG CGT ATG GTC AAG AAG TAC TGC TCT TTG TAC TTT TTA GTA TTG 54 2 4 

Arg Lys Arg Met Val Lys Lys Tyr Cys Ser Leu Tyr Phe Leu Val Leu 

1795 1800 1805 

GCT ATT TTT GCA GGA TGC ATT ATT GGT CCT GCT GTA GCC TCT GCT AAG 54 72 

Ala He Phe Ala Gly Cys He Tie Gly Pro Ala Val Ala Ser Ala Lys 
1810 1815 1820 

ATC CAC AAA CAC ATT GGA GAT TCA TTG GAT GGC GTT GTT CAC AAT CTA 5 52 0 

lie His Lys His He Gly Asp Ser Leu Asp Gly Val Val His Asn Leu 
4S 1325 1830 1835 1840 

TTC CAA CCA ATA AAT ACA ACC AAT AAT GAC ACT GGT TCC CAA ATG TCA 556 R 

Phe Gin Pro He Asn Thr Thr Asn Asn Asp Thr Gly Ser Gin Met Ser 
1845 1850 1855 

ACT TAT CAA AGT CAC TAC TAT ACT CAT ACG CCA TCA TTA AAG ACC TGG 5616 

SO Thr Tyr Gin Ser His Tyr Tyr Thr His Thr Pro Ser Leu Lys Thr Trp 
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1865 



1870 



TCA ACT ATA AAA 7AA 
Ser Tnr lie Lys 



bSJl 



(2) IMFOKKATION KOK Shiy ID NO : 2 : 

•i) SEOUENCE CHARACTERISTICS : 

(A) LENGTH: 1876 arr.ino acids 

(B) TYPE: aniao acid 
(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: proCein 

(xi) SEQUENCE DESCRIPTION: iiEQ ID NO : 2 : 

Met Arjn Thr Ar.p Cln Gin Pro Tyr Gin Cly Gin Thr Asp Tyr Thr Gin 
15 10 15 

Gly Pro Gly Asn Gly Gin Ser Gin Giu Gin Asp Tyr Asp Gin Tyr Gly 
20 25 30 

Gin Pro Leu Tyr Pro Ser Gin Ala Asp Gly Tyr Tyr Asp Pro Asn Val 
35 40 43 

Ma Ala Gly Thr Glu Ala Asp Met: Tyr Gly Gin Gin Pro Pro Asn Glu 
50 55 60 

2g Ser Tyr Asp Gin Asp Tyr Thr Asn Gly Glu Tyr Tyr Gly Gin Pro Pro 

65 70 75 fiO 

Asn Met Ala Ala Gin Asp Gly Glu Asn Phe Ser Asp Phe Ser Ser Tyr 
85 90 95 

Gly Pro Pro Gly Thr Pro Gly Tyr Asp Ser Tyr Gly Gly Cln Tyr Thr 
30 100 105 110 

Ala Ser Gin Met Ser Tyr Gly Gl'j Pro Asn Ser Ser Gly Thr Ser Thr 
115 120 125 
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Pro Tie Tyr Gly Asn Tyr Asp Pro Asn Ala He Ala Met Ala Leu Pro 
130 135 140 

Asn Glu Pro Tyr Pro Ala Trp Thr Ala Asp Ser Gin Ser Pro Val Ser 
14b 150 155 160 

He Glu Gin lie Glu Asp He Phe He Asp Leu Thr Asn Arg Leu Gly 
165 170 175 

Phe Gin Arg Asp Ser Met Arg Asn Met Phe Asp His Phe Met Val Leu 
180 185 190 

Leu Asp Ser Arg Ser Ser Arg Met Ser Pro Asp Gin Ala Leu Leu Ser 
195 200 205 

45 Leu His Ala Asp Tyr He Gly Gly Asp Thr Ala Asn Tyr Lys Lys Trp 

210 215 220 

Tyr Phe Ala Ala Gin Leu Asp Met Asp Asp Glu He Gly Phe Arg Asn 
225 230 235 240 

Mer, Ser Leu Gly Lys Leu Ser Arcj Lys Ala Arg Lys Ala Lys Lys Lys 
50 245 250 255 
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Asn Lys Lys Ala Met Glu Glu Ala Asn Pro Glu Asp Thr Glu Glu Thr 
260 265 270 

5 Leu Asn Lys lie Glu Gly Asp Asn Ser Leu Glu Ala Ala Asp Phe Arg 

27S 260 285 

Trp Lys Ala Lys Met: Asn Gin Leu Ser Pro Leu Glu Arg Val Arg His 
290 295 300 

^0 He Ala Leu Tyr Leu Leu Cys Trp Gly Glu Ala Asn Gin Val Arg Phe 

305 310 315 32C 

Thr Ala Glu Cys Leu Cys Phe lie Tyr Lys Cys Ala Leu Asp Tyr Leu 
325 330 335 

IS Asp Ser Pro Leu Cys Gin Gin Arg Gin Glu Pro Met Pro Glu Gly Asp 

340 345 350 

Phe Leu Asn Arg Val He Thr Pro He Tyr His Phe He Arg Asn Gin 
355 360 365 

20 Val Tyr Glu He Val Asp Gly Arg Phe Val Lys Arg Glu Arg Asp His 

370 375 380 

Asn Lys He Val Gly Tyr Asp Asp Leu Asn Gin Leu Phe Trp Tyr Pro 
385 390 395 40C 
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Glu Gly He Ala Lys He Val Leu Glu Asp Gly Thr Lys Leu He Glu 
405 410 415 

Leu Pro Leu Glu Glu Arg Tyr Leu Arg Leu Gly Asp Val Val Trp Asp 
420 425 430 

Asp Val Phe Phe Lys Thr Tyr Lys Glu Thr Arg Thr Tip Leu His Leu 
435 440 445 

Val Thr Asn Phe Asn Arg He Trp Val Met His He Ser He Phe Trp 
450 455 460 

Met Tyr Phe Ala Tyr Asn Ser Pro Thr Phe Tyr Thr His Asn Tyr Gin 
46b 470 475 480 

Gin Leu Val Asp Asn Gin Pro Leu Ala Ala Tyr Lys Trp Ala Ser Cys 
485 490 495 

Ala Leu Gly Gly Thr Val Ala Ser Leu He Gin He Val Ala Thr Leu 
500 SOS 510 

Cys Glu Trp Ser Phe Val Pro Arg Lys Trp Ala Gly Ala Gin His Leu 
515 520 525 

Ser Arg Arg Phe Trp Phe Leu Cys He He Phe Gly He Asn Leu Gly 
530 535 540 

Pro He He Phe Val Phe Ala Tyr Asp Lys Asp Thr Val Tyr Ser Thr 
545 550 555 560 

Ala Ala His Val Val Ala Ala Val Met Phe Phe Val Ala Val Ala Thr 
565 570 575 

lie He Phe Phe Ser lie Met Pro Leu Gly Gly Leu Phe Thr Ser Tyr 
580 585 590 

Met Lys Lys Ser Thr Arg Arg Tyr Val Ala Ser Gin Thr Phe Thr Aid 
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595 600 G05 

Ala Phe Ala Pro Leu His Gly Leu Asp Arg Trp Met Ser 'IV^" Leu Val 
610 615 620 

Trp Val Thr Val Phe Ala Ala Lys Tyr Ser Glu Ser Tyr Tyr Phe Leu 
625 630 63S 640 

Val Leu Ser Leu Arg Asp Pro lie Arg lie Leu Ser Thr Thr Ala Met 
645 650 555 

Arg Cys Thr Gly Glu Tyr Trp Trp Gly Ala Val Leu Cys Lys Val Gin 
660 665 670 

Pro Lys lie Val Leu Gly Leu Val lie Ala Thr Asp Phe lie Leu Phe 
675 680 685 

Phe Leu Asp Thr Tyr Leu Trp Tyr lie lie Val Asn Thr lie Phe Ser 
690 695 700 

Val Gly Lys Ser Phe Tyr Leu Gly lie Ser He Leu Thr Pro Trp Arg 
705 710 715 720 

Asn lie Phe Thr Arg Leu Pro Lys Arg Xle Tyr Ser Lys lie Leu Ala 
725 730 735 

Thr Thr Asp Met Glu He Lys Tyr Lys Pro Lys Val Leu lie Ser Gin 
740 745 750 

Val Trp Asn Ala He lie lie Ser Met Tyr Arg Glu His Leu Leu Ala 
755 760 765 

He Asp His Val Cln Lys Leu Leu Tyr His Gin Val Pro Ser Glu He 
770 775 780 

Glu Gly Lys Arg Thr Leu Arg Ala Pro Thr Phe Phe Val Ser Gin Asp 
735 790 795 flOO 

Asp Asn Asn Phe Glu Thr Glu Phe Phe Pro Arg Asp Ser Glu Ala Glu 
805 810 315 

Arg Arg He Ser Phe Phe Ala Gin Ser Leu Ser Thr Pro lie Pro Glu 
820 825 830 

Pre Leu Pro Val Asp Asn Met Pro Thr Phe Thr Val Leu Thr Pro His 
835 840 845 

Tyr Ala Glu Arg He Leu Leu Ser Leu Arg Glu He He Arg Glu Asp 
350 855 860 

Asp Gin Phe Ser Arg Val Thr Leu Leu Glu Tyr Leu Lys Gin Leu His 
B65 870 875 880 

Pro Val Glu Trp Glu Cys Phe Val Lys Asp Thr Lys He Leu Ala Glu 
885 890 895 

Glu Thr Ala Ala Tyr Glu Gly Asn Glu Asa Glu Ala Glu Lys Glu Asp 
900 905 910 

Ala Leu Lys Ser Gin He Asp Asp Leu Pro Phe Tyr Cys He Gly Phe 
915 920 925 

Lys Ser Ala Ala Pro Glu Tyr Thr Leu Arg Thr Arg He Trp Ala Ser 
930 935 940 
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Leu Arg Ser Gin Thr Leu Tyr Arg Thr lie Ser Gly Phe Met Asn Tyr 

945 950 955 960 

5 Ser Arg Ala lie Lys Leu Leu Tyr Arg Val Glu Asn Pro Glu lie Val 

965 970 975 

Gin Met Phe Gly Gly Asn Ala Glu Gly Leu Glu Arg Glu Leu Glu Lys 
980 985 990 

70 Met Ala Arg Arg Lys Phe Lys Phe Leu Val Ser Met Gin Arg Leu Ala 

995 1000 1005 

Lys Phe Lys Pro His Glu Leu Glu Asn Ala Glu Phe Leu Leu Arg Ala 
1010 1015 1020 
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Tyr Pro Asp [,eu Gin lie Ala Tyr Leu Asp Glu Glu Pro Pro Leu Thr 
1025 1030 1035 1040 

Glu Gly Glu Glu Pro Arg Tie Tyr 55er Ala Leu lie Asp Gly His Cys 
1045 1050 1055 

Glu lie Leu Asp Asn Gly Arg Arg Arg Pro Lys Phe Arg Val Gin Leu 

1060 1065 1070 

Ser Gly Asn Pro lie Leu Gly Asp Gly Lys Ser Asp Asn Gin Asn His 
1075 1080 1085 

Ala Leu lie Phe Tyr Arg Gly Glu Tyr lie Gin Leu He Asp Ala Asn 
1090 1095 1100 

Gin Asp Asn Tyr Leu Glu Glu Cys Leu Lys lie Arg Ser Val Leu Ala 
1105 1110 1115 1120 

Glu Phe Glu Glu Leu Asn Val Glu Gin Val Asn Pro Tyr Ala Pro Gly 
1125 1130 1135 

Leu Arg Tyr Glu Glu Gin Thr Thr Asn His Pro Val Ala He Val Gly 

1140 1145 1150 

Ala Arg Glu Tyr He Phe Ser Glu Asn Ser Gly Val Leu Gly Asp Val 
■35 1155 1160 1165 

Ala Ala Gly Lys Glu Gin Thr Phe Gly Thr Leu Phe Ala Arg Thr Leu 
1170 1175 1180 

Ser Gin He Gly Gly Lys Leu His Tyr Gly His Pro Asp Phe He Asn 
^0 1185 1190 1195 1200 

Ala Thr Phe Met Thr Thr Arg Gly Gly Val Ser Lys Ala Gin Lys Gly 
1205 1210 1215 

Leu His Leu Asn Glu Asp He Tyr Ala Gly Met Asn Ala Met Leu Arg 

45 1220 1225 123C 

Gly Gly Arg He Lys His Cys Glu Tyr Tyr Gin Cys Gly Lys Gly Arg 
1235 1240 1245 

Asp Leu Gly Phe Gly Thr He Leu Asn Phe Thr Thr Lys He Gly Ala 
SO 1250 1255 1260 

Gly Met Gly Glu Gin Met Leu Ser Arg Glu Tyr Tyr Tyr Leu Gly Thr 
1265 1270 1275 1280 

Gin Leu Pro Val Asp Arg Phe Leu Thr Phe Tyr Tyr Ala His Pro Gly 
55 1 2 8 5 1 2 9 0 1 2 9 5 
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Phe His Leu Asn Asn Leu Phe lie Gin Leu Ser Leu Gin Met Phe Met 
1300 1305 1310 

Leu Thr Leu Val Asn Leu Ser Ser Leu Ala His Glu Ser lie Met Cys 
1315 1320 1325 

:ie Tyr Asp Arg Asn Lys Pro Lys Thr Asp Val Leu Val Pre He Gly 
1330 1335 1340 

Cys Tyr Asr. Phe Gin Pro Ala Val Asp Trp Val Arg Arg Tyr Thr 
1345 1350 1355 1360 

Ser He Phe He Val Phe Trp lie Ala Phe Val Pro lie Val Val Gin 
1365 1370 1375 

Glu Leu He Glu Arg Gly Leu Trp Lys Ala Thr Gin Arg Phe Phe Cys 
1380 1385 1390 

His Leu Lgu Ser Leu Ser Pro Met Phe Glu Val Phe Ala Gly Gin He 
1395 1400 1405 

Tyr Ser Ser Ala Leu Leu Ser Asp Leu Ala He Gly Gly Ala Arg Tyr 
1410 1415 1420 

He Ser Thr Gly Arg Gly Phe Ala Thr Ser Arg He Pro Phe Ser He 
1425 1430 1435 1440 

Leu Tyr Ser Arg Phe Ala Gly Ser Ala He Tyr Met Gly Ala Arg Ser 
1445 1450 1455 

Met Leu Met Leu Leu Phe Gly Thr Val Ala His Trp Gin Ala Pro Leu 
1460 1465 1470 

Leu Trp Phe Trp Ala Ser Leu Ser Ser Leu He Phe Ala Pro Phe Val 
1475 1480 1485 

Phe Asn Pro His Gin Phe Ala Trp Glu Asp Phe Phe Leu Asp Tyr Arg 
1490 1495 1500 

Asp Tyr He Arg Trp Leu Ser Arg Gly Asn Asn Gin Tyr His Arg Asn 
1505 1510 1515 1520 

Ser Trp He Gly Tyr Val Arg Met Ser Arg Ala Arg He Thr Gly Phe 
3525 1530 1535 

Lys Arg Lys Lgu Val Gly Asp Glu Ser Glu Lys Ala Ala Gly Asp Ala 
1540 1545 1550 

Ser Arg Ala His Arg Thr Asn Leu He Met Ala Glu He He Pro Cys 
1555 1560 1565 

45 Ala He Tyr Ala Ala Gly Cys Phe He Ala Phe Thr Phe He Asn Ala 

1570 1575 1580 

GLn Thr Gly Val Lys Thr Thr Asp Asp Asp Arg Val Asn Ser Val Leu 
158^ 1590 1595 1600 

SO Arg He He He Cys Thr Leu Ala Pro He Ala Val Asn Leu Gly Val 

1605 1610 1615 

Leu Phe Phe Cys Met Gly Met Ser Cys Cys Ser Gly Pro Leu Phe Gly 
1620 1625 1630 

55 Met Cys Cys Lys Lys Thr Gly Ser Val Met Ala Gly He Ala His Gly 
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1635 1640 1645 

Val Ala Val lie Vai His lie Ala Phe Phe lie Val Met Trp Vai Leu 
1650 1655 1660 

Glu Ser Phe Asn Phe Val Arg Met Leu lie Gly Val Val Thr Cys He 
1665 1670 1675 1680 

Gin Cys Gin Arg Leu lie Phe His Cys Met Thr Ala Leu Met Leu Thr 
1685 1690 1695 

Arg Glu Phe Lys Asn Asp His Ala Asr. Thr Ala Phe Trp Thr Gly Lys 
1700 1705 1710 

Trp Tyr Gly Lys Gly Met Gly Tyr Met Ala Trp Thr Gin Pro Ser Arg 
1715 1720 1725 

Glu Leu Thr Ala Lys Val He Glu Leu Ser Glu Phe Ala Ala Asp Phe 
1730 1735 1740 

Val Leu Gly His Val He Leu He Cys Gin Leu Pro Leu He He He 
1745 1750 1755 1760 

Pro Lys He Asp Lys Phe His Ser He Met Leu Phe Trp Leu Lys Pro 
1765 1770 1775 

25 Ser Arg Gin He Arg Pro Pro He Tyr Ser Leu Lys Gin Thr Arg Leu 

1780 1785 1790 

Arg Lys Arg Met Val Lys Lys Tyr Cys Ser Leu Tyr Phe Leu Val Leu 
1795 1800 1805 

30 Ala He Phe Ala Gly Cys He He Gly Pro Ala Val Ala Ser Ala Lys 

1810 1815 1820 

He His Lys His He Gly Asp Ser Leu Asp Gly Val Val His Asn Leu 
1825 1830 1835 1840 
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Phe Gin Pro He Asn Thr Thr Asn Asn Asp Thr Gly Ser Gin Met Ser 
1845 1850 1855 

Thr Tyr Gin Ser His Tyr Tyr Thr His Thr Pro Ser Leu Lys Thr Trp 
1860 1865 1870 

Ser Thr He Lys 
1875 
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Claims 

1. A substantially pure ECB binding peptide connprising at least 46 contiguous amino acid residues of SEQ ID NO:2. 

2. A substantially pure ECB binding peptide, as in Claim 1 comprising the amino acid sequence defined by residues 
605 to 650 of SEQIDNO:2. 

3. An isolated nucleic acid compound encoding a peptide of Claim 1 or Claim 2. 

4. An isolated nucleic acid encoding a peptide of Claim 1 wherein said nucleic acid has a sequence selected from 
the group consisting of: 
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(a) (a) residues 1747 to 2016 of SEQ ID NO I; or 

(b) a nucleic acid compound complemGntary to (a). 

A vector comprising an isolated nucleic acid compound of Claim 3. 
A host cell containing a vector of Claim 5. 

A method for constructing a recombinant host cell having the potential to express an ECB binding domain of SEQ 
ID NO:2. said method comprising introducing into said host cell by any suitable means a vector of Claim 5. 

A method for expressing an ECB binding domain of SEQ ID NO:2 in the recombinant host cell of Claim 7. said 
method comprising culturing said recombinant host cell under conditions suitable for gene expression. 

A method for identifying compounds that bind an ECB binding domain, comprising the steps of; 

a) admixing in a suitable reaction buffer 

i) a substantially pure ECB binding peptide, as claimed in Claim 1 ; and 

ii) a test inhibitory compound; 

b) measuring by any suitable means a binding between said peptide and said compound. 
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